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1^ , Signaling is an important topic in the study of asymmetric information in economic settings. 

! In particular, the transparency of information available to a seller in an auction setting is a 

question of major interest. We introduce the study of signaling when conducting a second price 
f-H ■ auction of a probabilistic good whose actual instantiation is known to the auctioneer but not 

i to the bidders. This framework can be used to model impressions selling in display advertising. 

(/3 , We establish several results within this framework. First, we study the problem of computing a 

signaling scheme that maximizes the auctioneer's revenue in a Bayesian setting. We show that 
this problem is polynomially solvable for some interesting special cases, but computationally 
^ ■ hard in general. Second, we establish a tight bound on the minimum number of signals required 

. to implement an optimal signaling scheme. Finally, we show that at least half of the maximum 

, social welfare can be preserved within such a scheme. 
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1 Introduction 



A major concern in market design is to ensure that the markets are thick in the sense that there 
is a sufficient volume of participants to produce the necessary level of competition for the market 
to work well. Another concern is to design a practical language that is sufficiently expressive to 
allow players to specify how much they value the goods in the market. In the market for diamonds 
described by Levin and Milgrom [10] for example, the auctioneers could elicit bids for each individual 
stone. However, the enormous effort required for the players to learn the value of each individual 
stone and to submit individual bids would make the auction impractical. Moreover, bidding on 
each stone separately can lead to the cherry-picking phenomenon, where very few customers are 
interested in any one stone. This may lead to a situation where little revenue is generated although 
the goods are valuable. In practice, stones are categorized into deals and then auctioned. This 
method of treating different goods as identical is called conflation. 

Milgrom [llj and Levin and Milgrom [10] provide a comprehensive analysis of the phenomenon 
of conflation in various markets, with particular emphasis on online advertisement. In these mar- 
kets, auctioning each good individually is usually not an option and conflation must be used. One 
particular online market that we will focus on is the multibillion-dollar display advertisement mar- 
ket, where publishers (such as MSN and Yahoo) attempt to maximize the revenue they collect 
from the advertisers (say, Nike or Coca-Cola) for wisely targeting their ads at the right users. For 
example, an ad referring to the surfing lifestyle on the sunny beaches of the Pacific Ocean may be 
most valuable when targeted at a teenager from California; perhaps less so when targeted at a 10 
year old from Oregon; and even less when targeted at older folks in areas that are far from the 
ocean. However, it would be impossible for advertisers to decide how to bid on each individual 
impression. Instead, the impressions are categorized based on attributes such as the time when the 
impression was made, cookies in the user's browser, certain demographic properties, geographic 
location, etc.; impressions with similar attributes are then treated as instances of the same good. 

In the context of display advertisement, the main question we deal with is: How should those 
impressions be categorized in order to maximize the publisher's revenue? The high-level idea behind 
our model is to explore the natural asymmetry of information between the publisher and the adver- 
tisers: while advertisers may know the distribution of users visiting a particular site, the publisher 
usually has much more accurate information about each individual impression^ Upon receiving an 
impression, the publisher may choose to reveal to the advertisers certain attributes of this impres- 
sion (say, age and gender), while concealing other attributes (say, geographic location). One may 
argue that concealing information from the advertisers might generate inefficiencies in the market, 
but the amount of information is typically so large that it would be impossible for the advertisers to 
grasp everything anyhow. More importantly, ensuring that revenue is generated is essential for the 

^ In reality, the additional information about the visitor to a site is often handled by third party demand side 
platform (usually refered as DSP) . For simplicity, we abstract away this distinction. 
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proper functioning of markets. As described by Muthukrishnan |14j . in Ad Exchanges, which are 
systems that bring together pubhshers and advertisers in a common marketplace, ensuring good 
revenue is vital to keeping publishers in the market. 

More generally, our goal in this paper is to cope with the undesired effects of competition 
deficiency on some items in an auction. To achieve this goal, we will exploit an inherent information 
asymmetry between the auctioneer and the bidders that exists in many market settings. We mode|§ 
the auctioneer /bidders asymmetry by considering a framework termed a probabilistic single-item 
auction, in which n bidders participate in an auction for a single item, which is chosen randomly 
from a set of m indivisible goods according to a commonly known probability distribution p G A(m). 
In contrast to the bidders, who know only the probability distribution over the possible goods, the 
auctioneer knows its actual realization, and can use this informational superiority to increase the 
collected revenue. 

Specifically, the auctioneer may choose to reveal partial information to the bidders by means of 
a signaling scheme. A signaling scheme is a (possibly randomized) policy that specifies some signal 
a revealed to the bidders upon the choice (made by nature) of each good j G [m]. This policy 
is known to the bidders who can therefore induce the revealed signal a to update their perceived 
probability for the chosen good j from p{j) to the "more accurate" p{j \ a). 

One approach would be design an optimal auction from scratch for the problem of maximizing 
revenue in a probabilistic single item auction setting. Since we are in a setting with very correlated 
values, this is likely to produce contorted and impractical auctions in the style of the auction of 
Cremer and McLean [21 [3], where full surplus extraction is possible. Instead, we focus on the 
standard second price auction mechanism which is the de-facto standard for the sale of online 
advertisement |22l \S\ |T1] . We believe this will generate an auction that is more relevant to practical 
applications and that can be easily integrated with the current implementations. 

In this auction, after the bidders receive the signal, they submit their bids, and the winner and 
the payment are determined according to the second-price auction; namely, the winner is the bidder 
with the highest bid and the payment is the second-highest bid. The goal of the auctioneer, which 
is the subject of this paper, is to design a signaling scheme that maximizes her expected revenue. 

A simple but crucial observation that facilitates our analysis is that, similar to the classical set- 
ting of second-price auctions, here too, it is a dominant strategy for the bidders to reveal their true 
expected valuations, where the expectation in this context is taken with respect to the conditional 
probability p{j \ a). Therefore, the problem, termed revenue maximization by signaling, reduces 
to finding a signaling scheme that maximizes the expected second-highest bid (amounting to the 
expected revenue). 

Two trivial signaling schemes are the one that reveals no information to the bidders and the one 
^ For the formal exposition of our model, see Section (2] 
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that reveals the actual realization (all the information) . Interestingly, there are instances in which 
an appropriate signaling scheme provides a substantial improvement over the two trivial ones. This 
can be demonstrated already through a special case of a signaling scheme, termed clustering: The 
auctioneer a-priori partitions the set of goods into disjoint clusters, and the signal is the cluster 
that contains the chosen good. Consider the case in which there are m bidders and m types of 
goods, an item is chosen uniformly at random, and each bidder i is only interested in good i with 
a unit valuation. If no information is revealed then the expected revenue is 1/m as the expected 
valuation of each bidder is 1/m. If the actual realization is revealed, no revenue is collected since 
for every realization, the second-highest valuation is 0. However, if the goods are partitioned into 
clusters of size 2, then the expected revenue is 1/2, providing an improvement of a linear factor 
over the best trivial scheme. 

Note that clustering schemes can be thought of as restricting the auctioneer to deterministic 
policies. The class of signaling schemes considered in this paper is more general than clustering as 
we allow the auctioneer to toss coins when deciding on the revealed signal. 

Our Results 

We begin our analysis assuming that the valuations of the bidders are known to the auctioneer. 
In this somewhat less realistic case, the problem of revenue maximization by signaling can be 
formalized as a concise linear program and, as such, solved to optimality in polynomial time. A 
natural question is to what extent the increase in revenue comes at the expense of social welfare. 
Notably, we prove that a signaling scheme that obtains the optimal revenue can preserve at least 
half of the optimal social welfare. In addition, it is shown that if the auctioneer is restricted to 
invoking a signaling scheme by means of clustering, at least half of the optimal revenue can be 
achieved, and this is tight. 

Up until now we have assumed that the valuations of the bidders are known to the auctioneer. 
However, in practice, the auctioneer rarely knows the bidders' valuations. This motivates the 
main technical contribution of this paper, namely, the study of a Bayesian setting, in which the 
auctioneer holds probabilistic knowledge on the bidders' valuations. We show that in this case the 
revenue maximization by signaling problem becomes NP-hard. Still, in several cases of interest 
the problem remains tractable even in the Bayesian setting. Finally, we show that m signals are 
always sufficient to extract the optimal revenue. It is an interesting open problem how to find good 
approximation algorithms for the cases where the revenue maximization is NP-hard or to prove 
hardness of approximation in those cases. 

Notice that our model captures the Bayesian knowledge on behalf of the auctioneer by assuming 
a probability distribution over finitely many valuation matrices. This representation can capture 
complicated dependencies between the different valuations, however it may exhibit plenty of redun- 
dancy when the valuations are assumed to be independent. As such, it will be interesting to study 
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our framework under more concise representations - for example, where each entry of the matrix 
is sampled independently from some distribution. 



Related Work 

There is a rich theory on markets with information asymmetry. In such markets, agents on one 
side have more (or better) information than those on the other side. The foundation of this theory 
dates back to the work of Akerlof, Spence, and Stiglitz on the analysis of markets with asymmetric 
information, which earned them the 2001 Nobel Prize. In particular, Akerlof [1] introduced the 
first formal analysis of markets in which sellers have more information than buyers regarding the 
quality of products. Spence [181 US] demonstrated that in certain settings, well-informed agents 
can improve their outcome by signaling their private information to poorly informed agents. 

There is also a vast literature on the nature and effects of information revelation in auctions. 
One of the most fundamental results in auction theory, namely the "Linkage Principle" of Milgrom 
and Weber [12j, states that the expected revenue of an auctioneer is enhanced when bidders are 
provided with more information. While this work advocates transparency in various markets, later 
work observed that such transparency may not be optimal in general (see, e.g., |17 t l24 t [ ^ [7]). More 
recent work [lllllOj advocated the need for careful grouping of goods as an important market design 
principle. Our work may be viewed as a study of information revelation through an optimization 
lens, since we seek to maximize the expected revenue of an auctioneer by designing an effective 
information revelation scheme. 

Notice that Myerson's classic result on revenue maximization [15] does not apply to our model 
due to the asymmetry of information. Also, Myerson's mechanism works only for single parameter 
settings. Our Bayesian models are multi-parameter and are typically highly correlated. Revenue 
maximization results for correlated valuations [16^ H] also do not apply here due to asymmetric 
information. One could try to reveal all information and then apply one of those mechanisms, but 
we would get a rather contorted auction, with no guarantees against our auction. In fact, it is easy 
to construct examples where this generates arbitrary less revenue than our signaling scheme. Our 
auction, on the other hand, is very practical and close to what is actually implemented in online 
advertising markets. 



Closer to our work is that of Ghosh et al [9], which studies revenue-maximizing clustering 
schemes under a second-price auction in a setting with full information and additive valuations. 
While this setting is different from our framework of signaling in a probabilistic item auction, 
the mathematical formulation of the optimization problem in their setting is a special case of our 
optimization problem, i.e., the case where the valuation matrix is known to the auctioneer and 
the signaling scheme is restricted to take the form of a clustering scheme. Our focus, though, is 
on the more realistic Bayesian case which is not not treated in [9]. In addition, our framework 
relies on signaling that can be viewed as a "fractional" clustering, which is more powerful. Indeed, 
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while Ghosh et al. show that it is strongly NP-hard to compute the optimal clustering scheme, an 
optimal signaling scheme can be computed in polynomial time. Results of the similar flavour of 
the ones in |9J were re-derived independently in a previous version of the current paper [6]. 

Independently of our work, Miltersen and Sheffet [13] also analyze the problem of obtaining 
optimal signaling schemes for revenue maximization using Linear Programming, obtaining a result 
similar to our Theorem 13. 2[ 

2 The Model 

In this section, we introduce the auctioning model on top of which our signaling schemes are 
defined. Our focus in this paper is on a Bayesian setting, treating the uncertainty of the auctioneer 
regarding the bidders' valuations in a probabilistic manner. For clarity of the exposition, we shall 
first consider the (less realistic) known-valuations setting, where no such uncertainty is assumed. 

Known- Valuations Probabilistic Single-Item Auctions 

A known-valuations probabilistic single-item auction (KPSA) A is formally depicted by the four- 
tuple 

A = {n, m,p, V) , 

where n £ Z>o stands for the number of bidders, m € Z>o stands for the number of distinct 
indivisible goods, p G A(m) is a probability distribution over the goods, and V G R^q™ is a non- 
negative real matrix capturing the valuation V{i,j) of bidder i for good j. A single good j € [m] 
is chosen (by nature) according to the distribution p which is a common knowledge. 

The auction is conducted according to the second-price rule: Each player i places her bid hi and 
the chosen good j is sold to the bidder that placed the highest bid maxjg[„]{6j} (ties are broken 
arbitrarily) for the price of the second highest bid max2jg[„]{6j}. 

Signaling Schemes 

Although the bidders know the distribution p, they do not know its actual realization which is 
observed only by the auctioneer. In an attempt to increase her expected revenue, the auctioneer 
may partially reveal the realization j G [m] of p to the bidders. This partial revelation is carried out 
by means of signaling: given that the chosen good is j (recall that this choice is made by nature), 
the auctioneer sends the bidders some signal a; the bidders then hold a "more accurate picture" of 
the chosen good that corresponds to the probability distribution p conditioned on a. The policy 
that dictates the signal that the auctioneer reveals to the bidders for each good j G [m] is referred 
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to as a signaling scheme. It is important to point out that this poUcy is decided by the auctioneer 
and reported to the bidders prior to nature's random choice of item j. 

More formally, a signaling scheme is given by a set of s G Z>o signals and a signaling function 
iS : [s] X [m] — )• [0, 1] that satisfies 



Given that nature chose good j G [m], the auctioneer reveals signal a G [s] to the bidders with 
probability S{a,j). It will be convenient to use the notation S to address the signaling scheme as 
well as its inherent signaling function. 

Once again, it is assumed that S (and s) are decided by the auctioneer and reported to the 
bidders prior to the random choice of j; it is the actual signal a (determined according to S) that 
is revealed to the bidders after the choice of j. (This can be thought of as a commitment of the 
auctioneer to stick to the signaling scheme that it previously reported.) Upon receiving signal a, 
the bidders, knowing p and 5, update their belief from P(chosen good is j) = p{j) to 



For succinctness, we will subsequently denote the events "chosen good is j" and "signal is cr" by j 
and fj, respectively (our intention will be clear from the context). 

Before we proceed, let us consider the restricted variant of a deterministic auctioneer which 
is not allowed to use randomness when determining which signal to reveal. This is equivalent to 
imposing an additional "integrality" requirement on the signaling scheme: S{a,j) G {0, 1} for every 
a G [s] and j G [m]. In other words, each signal a G [s] now corresponds to a cluster C [m] so 
that the clusters are pairwise disjoint and Uo-e[s] ^o- = [w-]- Following this view, the general case 
(under which the auctioneer may use randomness when determining the signal) can be interpreted 
as a fractional clustering of the goods, where S{a,j) is the fraction of good j in cluster C^- 

It is well known that in the classical setting of second-price single- item auctions, it is a dominant 
strategy for the bidders to be truthful, i.e., to bid their true valuations [23]. It turns out that 
this remains valid in probabilistic single-item auctions under signaling as well, as the following 
observation demonstrates (proof deferred to the appendix). 

Observation 2.1. For every i G [n] and a G [s], bidding bi{a) = E,[V{i,j) | a] in response to the 
signal a is a dominant strategy for bidder i. 





P(chosen good is j \ signal is a) 



P(signal is a \ chosen good is j) ■ P(chosen good is j) 



P(signal is a) 

5(fT,j) -pij) 
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Optimization Problems 



Consider some KPSA A = {n,m,p,V) and signaling scheme S. In light of Observation 12.11 we 
subsequently assume that the bidders are indeed truthful, that is, bidder i bids E,[V{i,j) \ a] = 
^je[m]^U I ^) ' ^ihj) in response to the signal a. Therefore, the (expected) revenue of the 
auctioneer, denoted Rev_4(5), is given by 



Rev^(5) = P(o-) • max2ig[„] < 



nj\^)-v{i,j) 



This raises the following optimization problem, referred to as the revenue maximization by signaling 
(RMS) problem: given a KPSA A, construct the signaling scheme S that maximizes Rev_4(iS). One 
may also be interested in the (expected) social welfare resulting from signaling scheme S, defined 
as 

aels] [jeM 
When A is clear form the context, we may omit it from the subscript and write simply Rev (5) 
and SW(5). 

Notice that the revenue of the auctioneer can be rewritten as 



E 

J elm] 



P(a I j) ■ P(j) 



Rev(5) = Y ^(^) ■ max2ig[„] < 
= Y max2ig[„] < 
= Y max2ig[„] I Yj S{a,j) ■ ^'(i,j 



y{h3) 



Y S{a,j)-p{j)-V{i,j) 



(2) 



where ^{i,j) = p{j) ■ V{i,j) is referred to as the normalized valuation of bidder i for item j. 
Following the same line of arguments, we can also rewrite the social welfare as 



Under the deterministic auctioneer requirement, the RMS problem turns into the following 
clustering problem: Given the normalized valuation matrix ^ G devise a pairwise disjoint 

partition of [m] into clusters {Co-}o-e[si that maximizes 



Rev ({C^j^els]) = max2ig[„] < 



ere S 
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A Bayesian Setting 



Recall that up until now, we assumed that the valuations of the bidders are known to the auction- 
eerlfl However, in many practical scenarios the auctioneer does not know the exact valuation of 
each bidder. To tackle this obstacle, we assume a Bayesian setting, treating the state of knowledge 
that the auctioneer holds on the bidders' valuations in a probabilistic manner. This is captured in 
our model by considering k E Z>o distinct valuation matrices Vi, . . . ,Vk € R^q™" and a probability 
distribution q G A(A;) associating each valuation matrix Vi with the probability q{i) that it occurs. 
A probabilistic single-item auction (PSA) is then depicted by the 6-tuple 



where n G Z>o, m G Z>0; and p G A(m) have the same role as in the known- valuations case; and 



The expected revenue of the auctioneer from the signaling scheme S is now defined to be 



where ^i{i,j) = p{j) ■ V£{i,j) and the last equation follows from the same line of arguments that 
was used to establish ([2]). 

3 Optimal Signaling Schemes — The Known- Valuations Case 

Let us start our technical treatment of signaling schemes with (the simpler) known-valuations 
setting, considering a KPSA A = {n,m,p,V). We show that an optimal signaling scheme for A 
can be obtained by solving an LP with O(n^m) variables and 0(n^ + m) constraints (excluding the 
non- negativity constraints). 

Given an s-signal signaling scheme S for A and a signal cr G [s], let hf{a) and /if (c) denote the 
bidders i that realize maxjg[„] \j2je[m]'Si'^^j) ' and max2jg[„] \j2jG[m]'S{'^J) ' ^(«,j)}> 



respectively. (When the signaling scheme S is clear from the context, we may omit the superscripts.) 
Our concise LP relies on the following observation. 

Observation 3.1. There exists an optimal s-signal signaling scheme S for A such that given 
a, a' G [s], ifhf{a) = hf{a') and h^{a) = h'^{a'), then a = a'. 

^ In some sense, we also assumed that the valuations of each bidder are known to the other bidders. However, 
O bser vat ion I2.1l implies that this does not matter: a bidder is better of! bidding its true (expected) valuation regardless 
of the strategies of the other bidders. 




k G Z>o, q G A(A;), and 



Vi G M>Q™ > capture the aforementioned Bayesian angle. 
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Proof. Consider an optimal s-signal signaling scheme <S that minimizes s. We argue that S must 
satisfy the assertion. To that end, assume by contradiction that there are two distinct signals 
a, a' G [s] such that /if (a) = hf{a') = ii and /if(cr) = /if (a') = 12- Let S* be the (s - l)-signal 
signaling scheme obtained from S by replacing both signals a and a' by a new signal a* defined by 
setting S{a*,j) = S{a,j) +S{a',j) for every j G [m]. 

It is easy to verify that S* is valid in terms of ([1]). Moreover, since /if* (a*) = ii and /if* (cr*) = 12, 
we can use ([2]) to conclude that the combined contribution of a and a' to Rev (5) is 

^ (5(a,j)+5(a',j))-*(^2,j) = ^ j) • ^(is, j) 

which is precisely the contribution of a* to Rev(5*). Thus, Rev(5) = Rev(5*), in contradiction 
to the minimality of s. □ 

A direct corollary of Observation 13.11 is that it suffices to consider signaling schemes with 
s = n(n — 1) signals — each signal a is uniquely identified by /ii(cr) and /i2(o'). This turns out to be 
asymptotically tight as there are examples showing that ri(n^) signals are required to implement 
an optimal signaling scheme (see Section [5]) . Based on Observation 13.11 and on the formulation 
of revenue in ([2]), we can construct an optimal signaling scheme S by solving the following linear 
program, denoted LPl: 

max R{ai-^^i^) s.t. 

R{crii,i2) = X] '^('^»i.»2>j) • *(«2, j) Vii,i2 G H,ii / 12 

jG[m] 

X] '5(crii,i2, j) = 1 Vi G H 
S{(Ti-^^i^,j) > Vii,i2 G N,n ^ 'i2,Vj G [ni] . 

Since LPl consists of 0{n^m) variables and 0(n^+m) constraints (excluding the non-negativity 
constraints), it can be solved in polynomial time. 

Theorem 3.2. Under the known-valuations setting, the RMS problem can be solved in polynomial 
time. 

Signaling versus Clustering 

A clustering scheme is a special case of a signaling scheme, where the auctioneer cannot use ran- 
domness (see Section [2]). This restricted case has been studied in [9], and is equivalent to imposing 
the requirement that S{a,j) G {0, 1} for every a and j in our framework. 
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Theorem 3.3. The optimal revenue that can he extracted by a signaling scheme is at most twice 
the optimal revenue that can be extracted by a clustering scheme, and this is tight. 



Proof. The algorithm in [9] produces a clustering scheme that extracts revenue that is greater or 
equal to half of 

miny^max^'(z, j). (3) 

j 

Therefore, in order to establish the upper bound, it is sufficient to show that the revenue extracted 
by any signaling scheme is bounded by Equation ([3]). For every i' E [n], one can express the revenue 
of a signaling scheme S as 

Rev(5) = Y.max2iY,S{a,j)^{i,j) 

< J^^5((T,i)max^'(i,j) 

= Vmax^'(i,j) , 
^ — ' i^i' 
j 

where the last equality holds since X^o- '^('^i ~ ^- '^^^ upper bound follows. 

To establish the lower bound, consider Example 15.31 While the optimal signaling scheme ex- 
tracts revenue j^^ij^, it is not difficult to verify that the optimal clustering scheme partitions items 
1, . . . ,m into pairs and leaves item as a singleton. This clustering scheme extracts revenue 2{n+i) ' 
which is half of the revenue extracted by the optimal signaling scheme. □ 

We remark that while we used Equation ([3]) as our benchmark, a better benchmark would be 
to compare the clustering revenue to the solution of the LP in Theorem 13.21 However, Example [57 
demonstrates that | is tight with respect to this benchmark as well. 



Social Welfare versus Revenue 



Increasing the revenue by signaling usually comes at the expense of degrading the social welfare. 
We show, however, that it is easy to calculate the best revenue one can get without degrading 
the social welfare by much. For every j G [m], let denote the bidder i that maximizes the 
normalized valuation ^{i,j) (which means that i also maximizes V{i,j)). Then, the optimal social 
welfare is given by W* = Ylje[m] -^y augmenting LPl with the constraint 
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we guarantee the highest possible revenue conditioned on preserving at least a /3-fraction of the 
social welfare. Theorem 13.41 (whose proof is deferred to the appendix) shows that taking /3 < 1/2 
does not affect LPl. Note that this theorem can be viewed as a signaling analogue of Theorem 2 
in [9] and its proof essentially follows similar arguments. 

Theorem 3.4. There exists a revenue- optimal signaling scheme S with SW(cS) > W* /2. 

4 Optimal Signaling Schemes — The Bayesian Case 

We now turn to discuss the more interesting Bayesian setting, considering a PSA A = (n, m, k,p, q, {Vi}^^^;^]^, 
where g is a probability distribution over the valuation matrices Vi, . . . ,Vk € M"q™. Our goal in this 
section is twofold: (1) proving that the RMS problem under the Bayesian setting is NP-hard; and 
(2) presenting poly-time algorithms when k oi m are fixed. Note that the RMS problem remains 
NP-hard if n is fixed as long as both k and m are free parameters. 

Tractable Special Cases 

Let us start with developing an efficient algorithm for the RMS problem assuming that k = 0(1) 
(without any restriction on n or m). Consider some s-signal signaling scheme S for A. Given 
a Bayesian outcome I € [k] and a signal a € [s], let hf{£,a) and /if(£, cr) denote the bidders 
i that realize maxjg[„] [j2je[m] '^icrj) ■ ^£(«, j)} and max2ig[„] jEjeM •S{cr,j) ■ ^^(i,i)}, respec- 
tively. (When the signaling scheme S is clear from the context, we may omit the superscripts.) 
Using this notation, we can now state the following observation which is established by repeating 
the line of arguments that led to Observation 13. li 

Observation 4.1. There exists an optimal s-signal signaling scheme S for A such that given 
a, a' e [s], ifhf{£,a) = hf{i,a') and h^{l,a) = h^{l,a') for every I G [fc], then a = a' . 

Observation 14. II implies that it is sufficient to consider 0{v?^) signals cr, each uniquely identified 
by /ii(l, cr), /i2(l, ct), . . . , h\(k^ a), h2{k, a). In order to formulate it as a concise linear program, we 
fix 

A = {{{i\,il),...,{ii,4)) I ii e N G {1,2}, e G [k] A ii ^ i'^Ml G [A;]} . 
An optimal signaling scheme S can now be constructed by solving the following linear program. 
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denoted LP2: 

max Rjc^x) s.t. 
agA 

ee[k] j&[m] 

R{ax) = ^g(^) ^ 5(aA,j)-^£(A(£,2),j) VAeA 

J]5(aA,j) = 1 VjG[m] 
AeA 

5(aA,j) > VAg A,Vj G H . 

Since LP2 consists of 0{v?^m) variables and 0{'n?^ + m) constraints (excluding the non- 
negativity constraints), it can be solved in polynomial time as long as k is constant. 
Theorem 4.2. If k is fixed, then the RMS problem can he solved in polynomial time. 

Next, we show how to compute an optimal s-signal signaling scheme S when m = 0(1) (without 
any restriction on k and n). The main ingredient for this will be the following lemma (refer to |20] 
for a proof). 

Lemma 4.3. The number of distinct regions with non-empty interio^ defined byt>m hyperplanes 
in R™" is bounded from above by the Whitney number W{m,t) = YlT^o (i) — 0(t"^). 

Given some A G A, we define Xx to be the region that contains every vector x G M™q — {0} such 
that 

^ x{j) ■ l),i) > ■ (^(^'2), j) > ■ (^'•?') (4) 

jG[m] je[m] jeH 

for every I G [k] and i ^ {A(^, 1), 2)}. The key observation here is that if two signals a, a' G [s\ 
are such that their corresponding vectors S{a, •),5((7, •) G M>o ^^^^ ™to the same region Xx^ A G A, 
then we can merge them without decreasing the revenue. Therefore, if we can come up with a 
poly-size subset A' C A so that the regions in {Xx | A G A'} cover the entire M'>q — {0}, then we 
can construct an optimal signaling scheme by picking one signal (the right one) for each region Xx 
such that A G A'. 

So, how can we come up with such a subset A' C A? It turns out that although there are many 
regions Xx, only a polynomially small subset of them have a non-empty interior. Indeed, the total 
number of linear constraints ([4]) involved in the definition of the regions Xx, A G A, is n^/c (each 
linear constraint is of the form Yl,je[m] ^0) ' ^^(^i j) ^ Yl,je[m] ^0) ' ^^(^'' j) fo'^ some i, i' G [n] and 
(. G [k]). Since those linear constraints correspond to hyperplanes in M™, Lemma 14.31 guarantees 
that there are 0{{n'^k)"^) regions Xx with a non-empty interior. 

Once the subset A' C A of regions with non-empty interior has been identified, providing the 
linear constraints of (jH) for each such region, we can rewrite LP2, dedicating a single signal a to 
* A region R £ R™ is said to have a non-empty interior if it contains an m-dimensional open set. 
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each region X\ such that A G A' (the vector 5((T, •) takes the role of the vector x in (jH). The 
resulting linear program consists of 0([A'[ • m) variables and 0(|A'| • kn) constraints, thus it can be 
solved in polynomial time. 

It remains to show that we can efficiently enumerate the collection of regions with non-empty 
interiors. This is carried out by recursion on k: When = 1, we can simply iterate through all the 
regions and check if their interior is non-empty. For the recursive step, observe that if the region 
X\ has an empty interior, then clearly, so does the region X^^^.^+i .fc+i^ for every z^"*""*^, Zg^"*^ € [n]. 
Therefore, we can keep iterating only through those regions that had a non-empty interior in the 
previous recursive level. By Lemma [4.3^ the whole process requires checking n(n— l)-0((n^(/c— 1))"*) 
regions. 

Theorem 4.4. If m is fixed, then the RMS problem can he solved in polynomial time. 



Hardness of the General Case 



Finally, we establish the NP-completeness of (the decision version of) the RMS problem in the 
Bayesian setting for the case n = 3. The inclusion of this problem in NP follows from Theorem 15.21 
that ensures that it suffices to consider signaling schemes with at most m signals (which also implies 
that the number of bits required to represent the solutions of the LP is polynomial) . 

The remainder of the section is dedicated to proving that the RMS problem in the Bayesian 
setting is NP-hard. This is done by a reduction from MAX-CUT (problem ND16 in [8j): Given 
a graph G = (y, E) and two vertices x, y G V ^ the MAX-CUT problem asks for the maximum 
integer k such that there exists a vertex subset C/ C y, PI ?7| = 1, with at least k edges 

crossing between U and V — U . Given such an instance of MAX-CUT, assuming that |y| = n 
and \E\ = m, we construct a PSA A with 3 bidders, n items (associated with the vertices in V), 
and 2n -|- m — 3 Bayesian outcomes. It will be convenient to associate the Bayesian outcomes as 
follows: each 2<^<n — lis associated with some vertex u V — {x, y}; each n < £ < 2n — 3 is 
also associated with some vertex u G V — {x, y}; each 2n — 2<£<2n-|-m — 3is associated with 
some edge {u,v) G E. Figure [T] depicts the values of = q{£) ■ ^'^(i,j) for every i £ {1,2,3}, 

j G [n], and i G [2n + m — 3], where Ki ^ K2 S> 1 are integers that will be determined in the 
course of the proof. This specifies everything we need for the reduction. 

Suppose that the solution to the MAX-CUT instance is C* and that this is realized by the 

vertex subset X C U, where x & X and y (z Y = V — X . We design a signaling scheme S with 

• 1 . . ^ f 1 ifuGX , , f ifuG X ^ 

two signals crx,cry such that o{ax,u) = < and o{ay,u) = < .It can be 

\ if u £Y IlifnGy 

checked that the revenue generated by this signaling scheme is 

J2 Yl max2ie{i,2,3} I •'^(^'") [ = 2Ki + (n - 2)i^2 + m + . 

eel2n+m-3]cTe{aj:,ay} Ku&V ) 
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Figure 1: Representation of the mapping from MAX-CUT to Bayesian signaling. The tables 
represent for v and i € {1,2,3}. Values not specified in the tables are zero. 

The reduction is completed by showing that this is an upper bound on the revenue generated by 
any signaling scheme. 

Consider some s-signal signaling scheme S. The revenue of S is 

Rev(5) = X] ™ax2ig|i^2,3} \ u) ■ S{(t, u) \ 

^e[2n+m-3] crG[s] KueV ) 

= max{5((T,x),5(o-,y)} 

+ K2 ^ ^ [min{5((T,x),5((T,u)} +min{5(cr,y),5(cr,n)}] 

u&V—{x,y} (Te[s] 

+ X] X] ™^^"L'5(o",u),5(cr,v)} . (5) 

We argue that if S is optimal, then for each signal a, either S{(t,x) = or S{a,y) = 0. Indeed, if 
there is a signal a with S{a,x) and S{a,y) simultaneously positive, then we can split this signal 
into two signals a', a" such that S{a',x) = S{a,x) and S{a',u) = for n 7^ x; S{a",x) = and 
S{a",u) = S{a,u) for x. Taking Ki to be sufficiently large ensures that the revenue increases 
following this transformation. 

Now, consider a signal a £ [s] with S{a, x) > 0. If there exists some u ^ x such that S{a, u) > 
<S(cj, x), then there must exists a signal a' E [s] such that S{a',u) < S{a',x) since J2ae[s] <S{fT,x) = 
X](tg[s] '^('^' ~ ^- Therefore, we can increase the value of S{a',u) and decrease the value of 
S{a, u) by a small value 6, obtaining a valid signaling scheme with larger revenue (this is due to the 
fact that K2 is large compared to 1). Similarly, we can claim that in an optimal signaling scheme, a 
signal a G [s] with iS(c7, y) > has S{a, u) < S{(7, y) for all u € V. The same argument also implies 
that for each signal a that has positive probability, either S{a, x) > or S{a, y) > 0. 

Consider a signal a and let x = uq,ui, . . . ,Uk E F be the items with positive S{a, Ui). Assume 
without loss of generality that 5((T, uq) > S{a,ui) > ••• > S{a,Uk) > S{a,Uk+i) = 0. Now, 
split signal it in A; + 1 signals a^,a^, . . . ,a'' such that 5((T*,s) = S{a^,ui) = ■■■ = S{a'^,Ui) = 
S{a,Ui) — iS(cT, Uj+i). By substituting the split signals into ([S]), we conclude that the revenue is 
kept unchanged. 
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Therefore, PSA instances produced by our reduction always admit an optimal s-signal signaling 
scheme S such that for every signal cr G [s], there exist a vertex subset QV and a real < Po- < 1 
satisfying: 

(1) \U^n{x,y}\ = l; 

(2) S{a, u) = Pa for every u £ 17^; 

(3) 5(cT, n) = for every u ^ 11^; 

(4) J2a:veu„ Pa = 1 for every u G F; and 

(5) EaeW^^<^ ^ E,7:xe!7<. ^"-^ + T,a:yeUaPa = 2- This is employed in order to prove that Rev(5) < 
2Ki+{n-2)K2+m+C*. Prom (P, we see that Rev(5) = 2Ki+{n-2)K2+J2(u,v)&E E^gW max{5(fT, u),S{a, v)}, 
so it remains to show that 



E 



{u,v)eE crG[s' 



max{5((T, ti), 5((7, u)} < m + C* . 



(6) 



To that end, note that every edge (n, v) £ E and signal a E [s] contribute to the the left-hand 
side in ([6]) if llu,^} H C/o-| > 1; and otherwise. Therefore, the left-hand side in ^ is equal to 



{u,v)£E 



1 + (1/2) 

o-:|{u,i;}nl/„| = l 



where (?(C/) is the set of edges with exactly one endpoint in U, and the last inequality follows since 
E(tg[s] ^ ~ 1' hence J2ae[s] ^l^(^o-)| can be viewed as the average size of cuts corresponding to 
the vertex subsets Ua, cr £ [s]. This establishes the following theorem. 

Theorem 4.5. The decision version of the RMS problem in the Bayesian setting is NP-complete. 
The problem is hard already for n = 3. 



5 Bounding the Number of Signals 

In a market with n bidders, the algorithm described in Section [3] generates a signaling scheme with 
0{n?) signals. In fact, some instances might require that many signals in order to produce the 
optimal signaling. 

Example 5.1. Consider a KPSA with — n items appearing with uniform probability. Each 
item Jj^j/ is labeled with an ordered pair of bidders, where V{i,Ii^i') = 1, V{i',Ii^i') = ^, and 

V{i", li^i') = for every i" G [n] — {i, i'}. The optimal signaling generates revenue of |, by emitting 
a signal Cj^j' when either /j^j/ or /j/^j is chosen. Notice that s = (2) signals are required to implement 
this signaling scheme. It is not hard to see that with fewer signals, it is impossible to achieve this 
revenue. 

The KPSA described in Example 15.11 has m = r2(n^) items. Can we construct a similar bad 
example with much fewer items? More generally, can we bound the minimum number of signals 
required to implement an optimal signaling scheme as a function depending only on m? 
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Theorem 5.2. Every PSA has an optimal signaling scheme with m signals. 



Proof. For brevity, we establish the assertion assuming the known-valuations setting; the proof of 
the Bayesian setting follows from the same line of arguments. Consider some KPSA A = (n, m, p, V) 
and let S be an s-signal signaling scheme for A that minimizes s. Assume by contradiction that 
s > m. Associate with each signal fj G [s] a vector q^,. G IR>o, where Qaij) = '5(cr, j) for every j € [m]. 
We can rewrite the contribution of a to the revenue of S as Rev((7, 5) = max2jgj„]{^^gj^] qa{j) • 
so Rev(-,5) is a homogeneous, but not necessarily linear, operator. 

Since s > m, the vector collection {go- | a € [s]} must exhibit linear dependencies. Therefore, 
there must exist some reals xi, . . . , Xr, Xr+i, . . . ,xt > and signals ai, . . . a^, • • • , i^t such that 



Xiqai H y- Xrqur = Xr+iqar+i H 1" Xtqat ■ 

Consider the s-signal signaling scheme «S' defined by the modified signals a' obtained from S by 
setting 

(1 + exz)qa:, iil<z<r 
qa'^ = < (1 - exz)qa^ ifr + l<2;<i 
qcr^ otherwise . 



1 1 

1 and Qrri > 0. 



the resulting signaling scheme S' is valid as X^^^/gr^i qa' = 



For any e G 
J2ae[s] = 

Now, notice that the revenue of the new signaling scheme satisfies 

' r t 

Rev(5') = Rev(<S) + e y~]a;zRev(q-^,5) — ^ Xz'Rev{az,S) 

.2=1 2=r+l 

Since S is optimal, it must be the case that Y^z=i Xz^ev(az,S) — XzRev{az,S) = (recall 

that e can be taken to be positive or negative). Thus, we can take e to be either — ; — r or 

and get an optimal signaling scheme with less than s signals, in contradiction to the 

□ 



maxr+i<2<t{xz} 

choice of S. 



If one can achieve revenue R with s signals, then it is trivial to see that with s' < s signals, one 



can achieve revenue 



R. This turns out to be the best possible in some cases. 
Example 5.3. Consider n + 1 items {0, 1, . . . each chosen with probability and n + 1 
bidders with valuations V{i,i) = 1 for i = I, . . . ,n, ^(0,0) = n, and V{i,j) = otherwise. The 
optimal signaling scheme S uses n signals ai, ...,an such that S{ai,i) = 1 and 5((7j,0) = for 
i = 1, . . . ,n and zero otherwise. The revenue of this scheme is Now, for any s < n and an 
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s-signal signaling scheme <S', we have 



Rev(5') = J2 < 

a'els] 



5 E 



maxje[„]_{0} < 



< E — 

- ^ n + 1 



a'els] 



+ 1 + 1 
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APPENDIX 



Proof of Observation \2.1\ Consider the known- valuations ex-ante game defined by setting the strat- 
egy space of each bidder i € [n] to be the collection of all possible functions hi : [s\ M>o and the 
utility of bidder i from strategy profile h = (61, ... , 6„) to be the expected utility of bidder i in the 
KPSA A assuming that each bidder adheres to h. Fix some (arbitrary) strategies 6j' : [s] — )• M>o 
for all bidders i' ^ i and consider the strategy hi of bidder i that bids 

in response to each signal fi G [s\. 

Fix some signal a. From bidder i's point of view, the expected valuation of the chosen good is 
hi{a), whereas each other bidder i' ^ i, bids hi'{sigma). If bidder i does not win the chosen good, 
which happens only if maxj/^j{6j/((T)} > hi{a), then her utility in the ex-ante game is 0. This can 
be changed only if bidder i increases her bid so that it exceeds maxj/^j{6j' (cr)}, but this imposes 
a negative utility on i. So, assume that maxj'^j{6j'(cr)} < hi{a) and bidder i does win the chosen 
good. By the definition of the second-price rule, the utility of i must be non-negative. Clearly, 
bidder i has no incentive to increase her bid. Decreasing her bid does not change her utility as 
long as it still exceeds maxjYj{^j'('^)}j decreasing her bid further resets her utility to zero. The 
assertion follows. □ 

Proof of Theorem \3.4\ Consider an optimal signaling scheme and let cr be a signal and j an item 
such that 5(cr,j) > 0. If fi{j) ^ {/if (a), /if (cj)}, then we construct the new signaling scheme S 
obtained from S by replacing signal cr with the two new signals cr', a" such that S{a',j) = S{a,j), 
S{a',j') = for / ^ j; and S{a",j) = 0, S{a",f) = S{a,j') for f ^ j. 

We argue that Rev(5) > Rev(5) which, by the optimality of S, implies that Rev(iS) = 
Rev(5). To that end, note that the contribution of cr to Rev(5) is 

Rev(cT,5) = 5(c7,i).M/(/if(a),j) + J]5(a,/)-^(/if(^),/) 

< Sia,j)-^ihfia),j) + ^Sia,j')-^ihfia)j') , 

whereas the contributions of a' and a" to Rev(5) are 

Rev(cr',5) = 5(cr, j) • max2ie[„] {^'(i, j)} 

and 

Rev(cr",5) = max2jg[„] 



^(^,j') 



i 



respectively. The argument follows since 

Rev(a',5) > msx{S{a,j) ■ ^{hiia), j),Sia, j) ■ ^'(/if (a), j)} 

and 

Rev(c7",5) > mm\^Siaj')-^{hiia),j'),^Sia,f)-^{hf{a),j') 

It follows that there exists a revenue-optimal signaling scheme S such that S{a,j) > only if 
^ {hi{a), h2{cr)}. Therefore, the social welfare of <S satisfies 

SW(5) = E E S{a,j)-^{hr{a),j) 

ae[s]je[m] 

> ^ ^ 5(a j) . ^(^i(^)'j) + '^(^2(cT),j) 

ae[s] je[m] 

^ ^E E'^C'^'^')-*^^')'.?) = l^*- 

(Te[s] je[m] 

The assertion follows. □ 
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